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ABSTRACT 


Comparative sequencing of the maturase-encoding chloroplast gene matK has great potential for reconstructing 
phylogenetic relationships not only within families, but also within genera of land plants. This gene of 1550 bp is 
easily amplified due to highly conserved, flanking coding regions that include the trnK exons, грѕ16, and psbA. 
Several available sequencing primers also have wide applicability. Parsimony analysis of 45 matK sequences repre- 
senting Saxifragaceae sensu stricto provides a level of resolution comparable to that obtained via chloroplast DNA 
restriction site analysis. Furthermore, this analysis suggests relationships among genera and species that are highly 
concordant with the results of separate analyses of rbcL sequences and chloroplast DNA restriction sites, and with 
those of combined analyses of these three chloroplast DNA data sets. Parsimony analysis of 31 matK sequences 
representing all six sections of Gilia (Polemoniaceae) and 10 allied genera provides strong evidence for the polyphyly 
of Gilia and suggests relationships among sections of Gilia that are highly concordant with a recent ITS sequence 
analysis of the Polemoniaceae. Our analyses suggest that matK sequences are not strongly biased toward transitions, 
and the frequency of mutations at the first and second codon positions approach the frequency of mutations in the 


third codon position. 





Investigation of the chloroplast genome, either 
through analysis of restriction site mutations, struc- 
tural rearrangements, or DNA sequences, has dom- 
inated plant molecular systematic research during 
the past decade. These approaches have proven 


extremely useful in addressing a broad range of 


systematic and evolutionary questions at all levels 
of taxonomic hierarchy. Of these approaches, com- 
parative sequencing of chloroplast, as well as nu- 
clear, genes has become particularly popular in 
recent years, due in large part to the relative ease 
of generating sequences and the unambiguity of 
the data. The large number of recent systematic 
studies employing sequencing of the chloroplast 
gene rbcL attests to the enormous phylogenetic 
potential of comparative sequencing (e.g., Bruns- 
feld et al., 1994; Chase et al., 1993; Conti et al., 
1993; Donoghue et al., 1992; Gadek & Quinn, 
1993; Giannasi et al., 1992; Kim et al., 1992; 
Kron & Chase, 1993; Olmstead et al., 1993; Mor- 
gan & Soltis, 1993; Price & Palmer, 1993; Qiu 
et al., 1993; Rodman et al., 1993; Smith et al., 


1993; Soltis et al., 1990). Although rbcL sequence 
analysis has overshadowed the use of other gene 
sequences in plant systematics, the phylogenetic 
utility of several other DNA regions, both nuclear 
and organellar, has been investigated in plants. 
Among chloroplast genes, atpB (Ritland & Clegg, 
1987; Hoot et al., 1995 this issue), matK (Johnson 
& Soltis, 1994; Steele & Vilgalys, 1994), and 
ndhF (Olmstead & Sweere, 1994; Olmstead & 
Reeves, 1995, this issue) provide regions with dem- 
onstrated utility for inferring phylogenies at tax- 
onomic levels unresolvable with rbcL sequences 
alone. The use of these, and other regions of chlo- 
roplast DNA (cpDNA), such as noncoding inter- 
genic spacers (Gielly & Taberlet, 1994; Golenberg 
et al., 1993), for phylogenetic inference suggests 
that analysis of the chloroplast genome will con- 
tinue to provide important information for system- 
atics. 

Among protein coding regions in the chloroplast 
genome, matK (ORFK) is one of the most rapidly 
evolving (Wolfe, 1991). The chloroplast gene matK 
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is approximately 1550 base pairs (bp) in length 
and encodes a maturase involved in splicing type 
II introns from RNA transcripts (Neuhaus & Link, 
1987; Wolfe et al., 1992). In all photosynthetic 
land plants examined to date, matK is located 
within an intron of approximately 2600 bp located 
between the 5' and 3' exons of the transfer RNA 
gene for lysine, trnK (Fig. 1). However, in the 
reduced chloroplast genome of the nonphotosyn- 
thetic parasite Epifagus virginiana (L.) Barton 
(Orobanchaceae), neither the trnK coding regions 
nor associated intron are present (Wolfe et al., 
1992). In Epifagus, matK is bounded by trnQ 
and the psbA pseudogene. The presence of matK 
in the very reduced plastid genome of Epifagus 
suggests that matK is functionally important and 
has a broader intron splicing role than simply splic- 
ing the ¢rnK intron in which it typically occurs 
(Wolfe et al., 1992). Alignment of seven complete 
matK amino acid sequences representing bryo- 
phytes, gymnosperms, monocots, and dicots re- 
veals the presence of more conserved regions in- 
terrupted by stretches of sequence having little 
similarity (C. Morden and J. Palmer, pers. comm.). 
We recently reported that matK sequences of 
only 750 bp provided resolution of relationships in 
Saxifragaceae sensu stricto (s. s.) comparable to 
that obtained via restriction site analysis of the 
entire chloroplast genome based on 20 endonu- 
cleases; these matK sequences provided greater 
resolution than did rbcL sequences (Johnson & 
Soltis, 1994). Herein we explore further the phy- 
logenetic utility of та sequence variation 
through: (1) phylogenetic analyses of an expanded 
data set for Saxifragaceae s. s., a preliminary data 
set for СШа (Polemoniaceae), and analysis of the 
polyploid origin of Saxifraga osloensis Knaben; 
(2) description of matK sequence variation; and 
(3) discussion of the broad applicability of matK 
sequencing primers in various plant groups. 
Saxifragaceae s. s. continue to provide an ideal 
opportunity for assessing the utility of other chlo- 
roplast data sets because the family is well defined 
by molecular data, and both rbcL sequences and 
cpDNA restriction sites have been previously gath- 
ered for virtually all genera (e.g., Soltis et al., 1991; 
Soltis et al., 1993; Morgan & Soltis, 1993). We 
have expanded our matK sequence matrix for Sax- 
ifragaceae s. s. from that reported by Johnson & 
Soltis (1994) to include an additional 324 bp and 
14 additional species yielding 1078 bp of sequence 
data for 45 taxa. Additional species of Saxifraga 
and Chrysosplenium have been included because 
these genera possessed the longest branch lengths 
in our previous study (Johnson & Soltis, 1994). 


Additional members of the Heuchera and Boykinia 
groups of genera were also included to explore more 
fully the degree of resolution obtainable with matK 
sequences in these well-supported groups. We pres- 
ent analyses of matK data alone, as well as the 
results of combined analyses involving matK and 
rbcL sequences and cpDNA restriction sites (Soltis 
et al., 1993) to provide a comprehensive view of 
relationships in Saxifragaceae s. s. as suggested by 
these three cpDNA data sets. We also discuss the 
fine-scale resolution that matK can provide using 
an example of polyploid evolution in Saxifraga. 

In addition to Saxifragaceae s. s., we also illus- 
trate the phylogenetic utility of matK sequences 
using analyses of a preliminary data set for Gilia 
(Polemoniaceae). Steele & Vilgalys (1994) pre- 
sented an analysis using matK sequences to resolve 
relationships among genera of Polemoniaceae, but 
they did not address intrageneric relationships with- 
in any of these genera. Gilia is a large, morpho- 
logically diverse genus of questionable monophyly 
and affinities. In many respects, the approximately 
70 species currently recognized in Gilia (Day, 
1993a) are united more by the lack of synapo- 
morphic characters that circumscribe any of the 
other 12 to 14 temperate genera rather than by 
any consistent set of characters unique to them- 
selves. We compare our phylogenetic trees for 
Gilia based on matK sequences to those obtained 
in a recent analysis of sequences obtained from the 
nuclear ribosomal DNA internal transcribed spacer 
regions (rDNA-ITS) that included this genus (Por- 
ter, 1993). 

We describe the nature of sequence variation 
in matK, including: (1) comparison of nucleotide 
variability between matK and rbcL, ndhF, and the 
ITS regions; (2) comparison of transition : trans- 
version ratios and substitution rates by codon po- 
sition between matK and rbcL; (3) determination 
of the degree of random structuring of variation 
(Archie, 19892); and (4) determination of the num- 
ber and phylogenetic distribution of insertion-de- 
letion events. 

Lastly, in an effort to promote the use of matK 
by other investigators, we also discuss the appli- 
cability of PCR and sequencing primers in various 
taxonomic groups. 


MATERIALS AND METHODS 
AMPLIFICATION AND SEQUENCING 


DNA was isolated from all taxa (Appendix 1) 
using a CTAB buffer method (Doyle & Doyle, 
1987) as modified by Soltis et al. (1991). Ampli- 
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Relative location of amplification and sequencing primers used іп this study for sequencing таг К and 


the trn K intron. Numbers refer to primers listed in Table 1. The shaded region above matK indicates the region 
sequenced for species included in the Saxifragaceae s. s. and Са data matrices. 


fication of matK was accomplished via the poly- 
merase chain reaction (PCR) to obtain sufficient 
quantities of DNA for sequencing. Several syn- 
thetic PCR primers located in the 5'trnK and 3'trnK 
exons, as well as in the rpsl6 and psbA genes 
that flank trn К (and thus matK; Fig. 1, Table 1), 
have been designed. Two PCR primer pairs (rps 1 6- 
4547F and trn K-2R; and trnK-3914F and psbA- 
R) provide DNA amplification products for initial 
sequencing of the entire trn K intron using tri K- 
3914F and trnK-2R, respectively, as sequencing 
primers (e.g., Johnson & Soltis, 1994). In this 
fashion, we sequenced the entire /гпК intron in 
Bensoniella oregona, Saxifraga integrifolia, and 
штапа sullivantii (а complete тако se- 
quence, excluding the flanking trn К. intron regions, 
was also generated for Sullivantia oregana). For 
all other taxa, we used trn K-3914F and trn K-2R 


to produce double-stranded DNA from total ge- 
nomic DNA. We subsequently used this double- 
stranded DNA as a template to produce single- 
stranded DNAs using primer trn K-3914F individ- 
ually for forward strand synthesis, and primer trn K- 
2K for reverse strand synthesis. All PCR reactions 
used Replitherm polymerase from Epicentre Tech- 
nologies following the manufacturer's suggested 
concentrations of all reagents and DNA. The PCR 
temperature profile we employed consisted of 30 
cycles at 94°C for 1 minute 30 seconds, 48°C for 
2 minutes, and 72°C for 3 minutes with an addi- 
tional 15 minutes at 72°C following the final cycle. 
Secondary bands were occasionally observed in 
agarose test gels of our single-stranded DNA am- 
plification products but only rarely in our double- 
stranded DNA products. These secondary bands 
never posed any problems during sequencing. 


TABLE 1. Base composition of amplification and sequencing primers discussed in this study. Primers rps16-4547F 
and trnK-3914F (monocot) were designed by Jerry Learn. We later modified trnK-3914F (monocot) to produce 
trnK-3914F (dicot) and hence provide greater homology to dicots. Primers trnK-2R and psbA-R were designed by 
Kelly Steele. All other primers were designed in our lab and named based on their approximate position on the trnK 


map for Sinapis (Neuhaus & Link, 1987). Primer reference numbers correspond to those used in Figure 1. 





Reference 

number in 

Figure 1 Name 5' Sequence 3' 
1 rpsl6-4547F AGG TGC TCA ACC ТАС ААС ААС C 
2 irn K-3914F (dicot) GGG СТТ GCT AAC TCA ACG С 
2 trnK-3914F (monocot) ATC TGG GTT GCT AAC TCA ATG G 
8 trnK-253F TTG GGT CGA GTC AAT AAA T 
4 trnK-582F СТА АСС АТС TTG TTA ТСС T 
5 trnK-710F GTA TCG CAC TAT GT[T/AJ TCA TTT GA 
6 matK-934F ATT. TTG СТТ АТС ACA АТА А 
7 matK-1168R ATT GAA TGA ATT GAT CGT A 
8 matK-1176F САА. TTC ATT CA[A/C] TAT TTG CIT 
9 matK-1235R СОГАР СТС ССС TAT TAG TAT A 
10 matkK-1412F ATA TAA TTC TTA TGT ATG TG 
10 matK-1412R САС АТА [С/САТ ААС AAT TAT AT 
11 matK-1470R ААС АТС TTG AT[T/C] СТА AAT СА 
12 matK-1506R ТТС ‘CAT АСА AAT ATA TTC С 
13 matK-1848R TAT ССА. ACT TCT TAA ТАС СЄ 
14 matK-2000R АТТ TET GCA. ТАТ GCG CAC ААА TG 
15 matK-2200R TCI СТА ТАА CECT GCA САА AG 
16 trnK-2R ААС TAG ТСС GAT ССА СТА С 


17 psbA-R ССС. СТС. ТСТ СТА. AAA TTG CAG TCA T 
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Following PCR amplification, single-stranded 
DNAs were precipitated with 20% PEG/2.5M 
NaCl, washed in 70% and 95% EtOH, dried, and 
resuspended in TE (Morgan & Soltis, 1993). We 
subsequently employed dideoxy sequencing of the 
resuspended PCR products using the Sequenase 
2.0 kit (U. S. Biochemical Corp.) and a set of 
sequencing primers (primers matK-1168R ог 
matK-1235R, matK-1470R, and matK-1412F; 
or matK-1168R, matK-1506R, and matK-1848R; 
Fig. 1; Table 1). These primers enable us to se- 
quence routinely over two-thirds of matK, begin- 
ning at the 5' end (Fig. 1). Approximately 500 bp 
at the 3' end were not sequenced because we have 
obtained sufficient resolution to address our present 
inquiries with the approximately 1080 bp obtained 
using the sequencing primers described above. 


PLANT SAMPLES 


Saxifragaceae s. 5. In addition to extending 
our previously reported matK sequences (Johnson 
& Soltis, 1994) by 324 bp, we sequenced 14 new 
species for a total of 45 sequences representing 
virtually all of Saxifragaceae s. s. (Appendix 1). 
Despite repeated attempts, we have been unable 
to obtain suitable material for DNA isolation from 
the monotypic genera Hieronymusia, Saxifragel- 
la, and Zahlbrucknera due to their geographically 
remote and restricted distributions. Oresitrophe is 
also missing from this analysis because we were 
not able to obtain material from this monotypic 
genus until after the lengthy phylogenetic analyses 
of Saxifragaceae s. s. were completed. Tetracar- 
paea and Ribes (Grossulariaceae) were included 
as outgroups because previous studies have shown 
these genera to be close relatives of Saxifragaceae 
s. s. (Morgan & Soltis, 1993; Soltis et al., 1993). 
Partial sequences of Saxifraga osloensis, S. tri- 
dactylites, and S. adscendens have also been ob- 
tained and are discussed below as an example of 
the insights matK sequences can provide regarding 
polyploid origins; these sequences were not, how- 
ever, included in our broad parsimony analyses of 
Saxifragaceae s. s. 

Partial matK sequences for Saxifragaceae s. s. 
vary from 1039 to 1063 bp in length and provide 
a matrix of 1078 characters after alignment. Ten 
insertion-deletion events (indels) of three, six, or 
nine nucleotides distributed among 18 species ac- 
count for the length variation in these sequences 
(Appendix 2). We easily aligned these sequences 
visually and positioned indels so as to minimize 
base substitutions while maintaining the proper 
reading of codons. After alignment, we checked 
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the reading frame and position of indels by trans- 
lating the sequences to amino acids using either 
MacClade version 3.01 (Maddison & Maddison, 
1992) or MEGA version 1.01 (Kumar et al., 1993). 
We scored missing bases associated with indels as 
ambiguous ('*?"), rather than as a fifth character- 
state, and considered the phylogenetic distribution 
of each indel a posteriori by mapping its occurrence 
on trees derived from analysis of base substitutions 
alone. 

These matK sequences are the third chloroplast 
DNA data set constructed for the purpose of re- 
solving relationships within Saxifragaceae s. s. A 
combined analysis of the other two data sets, cpDNA 
restriction sites and rbcL sequences, has recently 
been reported (Soltis et al., 1993). Here we com- 
pare and also combine our matK sequences with 
these two other DNA data sets to obtain a com- 
prehensive view of relationships suggested by 
cpDNA data in Saxifragaceae s. s. Because slight 
differences exist in the taxa sampled for the three 
molecular analyses (cpDNA restriction sites and 
matK and rbcL sequences), we constructed three 
different combined matrices (Appendix 1) in an 
effort to obtain the most comprehensive combined 
analysis possible. Matrix-l is the "purest" com- 
bined data set comprising 21 species for which all 
three character sets exist. Matrix-2 comprises all 
of the taxa from matrix-1 with the addition of 
Astilbe and Chrysosplenium, genera for which 
different species were analyzed in the various DNA 
studies. A complete data set for Astilbe was formed 
by combining the rbcL sequence and restriction 
site data from 4. taquetii with the matK sequence 
from A. japonica X chinesensis. Similarly, a com- 
plete data set for Chrysosplenium was formed by 
combining matK and rbcL sequences from C. iow- 
ense with restriction site data from C. americanum. 
This approach seemed reasonable given that both 
Astilbe and Chrysosplenium are distinctive, well. 
defined genera in Saxifragaceae. We thus feel the 
likelihood of conflict among the character sets for 
these genera is not substantially greater than that 
existing among character sets for any of the other 
taxa for which a single species was used to generate 
all three data sets. Matrix-3 contains the broadest 
sampling of saxifragaceous genera, including all of 
the taxa in matrix-l and an additional 22 taxa for 
which any two of the three character sets were 
available. For example, Suksdorfia violacea is in- 
cluded in matrix-3 because both cpDNA restriction 
site and matK sequence data are available but not 
rbcL sequence data. Similarly, Astilbe and Chry- 
sosplenium are represented in this matrix only by 
the character sets obtained from А. taquetii (cpDNA 
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restriction site and rbcL sequence data) and C. 
iowense (matK and rbcL sequence data), rather 
than by the composite data set used in matrix-2. 
We filled the missing character set for each of the 
1] taxa for which a character set is missing with 
**?" to indicate ambiguity for all missing charac- 
ters. 


Gilia. Сша comprises approximately 70 spe- 
cies in six sections (sensu Day, 1993a, b). Although 
we are interested primarily in intersectional rela- 
tionships involving sections Saltugilia and Kellog- 
gia, Gilia has served as the catch-all among tem- 
perate Polemoniaceae (Grant, 1959; Wherry, 
1940), and the matK sequence results (see below) 
underscore the necessity of broad sampling among 
allied genera to gain the clearest picture of rela- 
tionships in Gilia. Our preliminary data matrix for 
Gilia includes 31 species representing all six sec- 
tions of Gilia (sensu Day, 1993a, b) and eight 
allied temperate genera (Allophyllum, Collomia, 
Eriastrum, Ipomopsis, Langloisia, Navarretia, 
Phlox, and Polemonium; Appendix 1). We se- 
lected the tropical genera Bonplandia and Cantua 
as outgroups because both traditional (e.g., Grant, 
1959) and molecular data (Porter, 1993; Steele 
& Vilgalys, 1994) suggest these genera are close 
allies of the temperate Polemoniaceae. 

The matK sequences generated for Gilia vary 
from 1065 to 1080 bp in length and provide a 
matrix of 1083 characters after alignment. Four 
indels of three, six, or nine nucleotides in length 
distributed among eight taxa account for the length 
variation in these sequences (Appendix 2). As with 
the Saxifragaceae, we scored all missing bases as 
ambiguous (“?”) and considered the phylogenetic 
distribution of indels after parsimony analysis of 
the base substitutions alone. 


PHYLOGENETIC ANALYSES 


PAUP (version 3.1.1, Swofford, 1991) installed 
on a Macintosh Centris computer was used to search 
for most parsimonious trees, perform bootstrap 
analyses (Felsenstein, 1985; see also Felsenstein 
& Kishino, 1993; Hillis & Bull, 1993) and decay 
analyses (Bremer, 1988; Donoghue et al., 1992), 
and to calculate the retention index (RI, Farris 
1989a, b; see also Archie, 1989b, 1990) and con- 
sistency index (CI, Kluge & Farris, 1969) with 
autapomorphies retained. Parsimony analyses em- 
ployed heuristic searches using TBR branch swap- 
ping, MULPARS, HOLD = 5 and SIMPLE ad- 
dition. Five hundred replications of RANDOM 
addition with TBR branch swapping and MUL- 
PARS were also employed to search for multiple 
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islands of most parsimonious trees (Maddison, 
1991). Bootstrap analyses for each data set con- 
sisted of 100 replications using heuristic searches 
with TBR branch swapping, SIMPLE addition, and 
HOLD - 5. For our Saxifragaceae s. s. data set 
of matK sequences, we additionally set a MAX- 
TREE limit of 1000 trees per replicate after the 
computer ran out of memory on its 38th replication 
in an earlier analysis. Because few of the repli- 
cations reached this limit and we used the bootstrap 
as only one estimate of support for monophyletic 
groups, we do not feel this restriction compromises 
the integrity of our results. Decay analyses were 
performed for all data sets using the following two 
approaches. For all data sets except the matK 
sequence matrix for Saxifragaceae s. s. and the 
combined cpDNA data matrix-3, TBR branch 
swapping and 100 RANDOM addition heuristic 
searches that saved all trees up to five steps longer 
than the most parsimonious tree length were per- 
formed. Strict consensus trees were formed from 
these trees for each length greater than the most 
parsimonious tree length after filtering out trees of 
inappropriate length. This approach was inade- 
quate, however, for the matK sequence matrix and 
the combined cpDNA data matrix-3 for Saxifra- 
gaceae s. s. because of computer memory con- 
straints and the large number of trees even a few 
steps greater than the most parsimonious tree length. 
For these data matrices, the decay analysis was 
conducted successively for each step greater than 
the most parsimonious tree length for up to five 
steps by saving only trees from heuristic searches 
that failed to satisfy a constraint topology imposed 
by the strict consensus of all trees found in the 
previous steps (Johnson & Soltis, 1994; Morgan 
et al., in press; Soltis & Kuzoff, in press). 


VARIABILITY ANALYSES 


Comparisons between genes. To determine 
the distribution of base substitutions and relative 
variability of regions within matK, we used the 
IBM-PC program MEGA to make pairwise com- 
parisons of nucleotide differences per site in com- 
plete matK and rbcL sequences from three saxi- 
fragaceous taxa: Bensoniella oregona, Saxifraga 
integrifolia, and Sullivantia oregana. Pairwise 
comparisons of nucleotide differences and amino 
acid differences were also made among these three 
species and an additional 22 saxifragaceous taxa 
for which rbcL and over two-thirds of matK have 
been sequenced. Separate parsimony analyses of 
these 25 matK and rbcL DNA sequences were 
performed to estimate homoplasy as indicated by 
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CI and RI and to compare transition : transversion 
and codon position ratios as calculated by Mac- 
Clade over a most parsimonious tree for each data 
set. 

The increasing availability of DNA sequences 
from regions other than rbcL also enabled us to 
conduct two additional small-scale comparisons be- 
tween matK and other DNA sequences. In one 
such analysis, we made pairwise comparisons of 
nucleotide substitutions per site between matK and 
ndhF for Coriandrum (Apiaceae), Griselinia (Cor- 
naceae), and Hedera (Araliaceae) using ndhF se- 
quences kindly provided by Robert Jansen. A sim- 
ilar comparison was made between matK and ITS 
sequences (ITS-1 and ITS-2 combined) for Gilia 
leptalea, G. scopulorum, and G. splendens. 


Variation in Saxifragaceae s. s. and Gilia 
matK matrices. We calculated the transition: 
transversion and codon position ratios as recon- 
structed over a most parsimonious tree for both 
the Saxifragaceae s. s. and Gilia matK sequence 
matrices. The consistency indices for the Saxifra- 
gaceae s. s. and Gilia matK data sets were also 
calculated and compared to expected CI values 
derived from the regression equation of Sanderson 
& Donoghue (1989) after first removing all au- 
tapomorphies from each data set with the aid of a 
spreadsheet program. 

To assess how the observed variation in our 
Saxifragaceae s. s. and Gilia matK matrices is 
structured, we employed the randomization test of 
Archie (1989a; see also Kallersjo et al., 1992). 
The length of the most parsimonious trees from 
analyses of both data sets was compared to the 
distribution of most parsimonious trees derived from 
analyses of 100 data sets created by randomly 
permuting character states within characters via 
the SHUFFLE option in MacClade. Analyses of 
each randomized data set consisted of two heuristic 
searches employing first SIMPLE addition, and 
then CLOSEST addition, HOLD = 5, MULPARS, 
ТВК branch swapping, and MAXTREES = 50. 
The shorter of the two results was used as the most 
parsimonious tree length. This approach provided 
sufficient speed for determination of non-random 
structure in our large data sets and was more likely 
(based on our initial tests) to obtain a shorter av- 


erage tree length from the sample of 100 random- 
ized data sets than an approach using thorough 
(i.e., no MAXTREE limit) searches employing only 
SIMPLE or only CLOSEST addition. The average 
length of the most parsimonious trees from these 
randomized data sets was also used to calculate the 
homoplasy excess ratio (HER; Archie, 1989b) for 


both Saxifragaceae s. s. and Gilia. 


RESULTS 
PHYLOGENETIC ANALYSES 


In analyses of all matrices, bootstrap and decay 
values generally agree in indicating support for 
monophyletic groups. Decay values 2 5 are rarely 
found on branches with bootstrap support of less 
than 9546, and bootstrap values >95% are rarely 
found on branches with decay values <3 (Figs. 2- 


6). 


Saxifragaceae s. s. Parsimony analysis of 
matK sequences for Saxifragaceae s. s. resulted in 
two islands with a total of 372 trees of 842 steps 
(Fig. 2). The consistency index excluding auta- 
pomorphies for these trees is 0.583, a higher value 
than that expected for 45 taxa using the regression 
equation of Sanderson & Donoghue (1989; Table 
2). The large number of most parsimonious trees 
is primarily the result of homoplasy among the 
small number of substitutions supporting the basal 
branches. Increased homoplasy caused by the in- 
clusion of a few highly divergent taxa also con- 
tributes to the large number of most parsimonious 
trees obtained. For example, although Saxifraga 
mertensiana is united with 5. cernua and S. op- 
positifolia in all most parsimonious trees and the 
three taxa share 35 base substitutions (Fig. 2), 
removing 5. mertensiana prior to parsimony anal- 
ysis decreases the number of most parsimonious 
trees to 126. 

Parsimony analyses of the combined cpDNA 
restriction site and matK and rbcL sequence data 
sets for Saxifragaceae s. s. all yielded single islands 
of most parsimonious trees. Matrix-1 yielded two 
most parsimonious trees of 988 steps with a CI of 
0.801 (autapomorphies retained; Fig. 3). Matrix-2 
yielded a single most parsimonious tree of 1137 
steps with a CI of 0.764 (autapomorphies retained; 





FIGURE 2. 


— 


One of 372 most parsimonious trees from analysis of matK sequences for Saxifragaceae s. s. Base 


substitutions (ACCTRAN) are indicated above branches. Bootstrap and decay values are indicated below branches, 
respectively. Dashed lines represent branches that are not supported by all most parsimonious trees; in these instances 
the decay value is zero and is not indicated following the bootstrap value below the branch. Letters (A-J) denote the 


distribution of specific indels referenced in Appendix 2. 
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Fig. 4). Matrix-3 yielded 24 most parsimonious 
trees of 1606 steps with a CI of 0.727 (autapo- 
morphies retained; Fig. 5). Our analyses of these 
three matrices all demonstrated the presence of 
the same well-supported groups of genera (1.е., the 
Boykinia group, the Darmera group, the Неи- 
chera group, and Leptarrhena/ Tanakaea), groups 
also found in independent analyses of matK se- 
quences (Fig. 2) and rbcL sequences and cpDNA 
restriction sites (Soltis et al., 1993). In contrast, 
relationships among these well-supported groups of 
genera are poorly resolved, and there is consid- 
erable disagreement in relationships а! basal 
branches among the shortest trees resulting from 
each of these three combined matrices (Figs. 3- 
5). 
Gilia. 
for Gilia yielded a single island of 36 most par- 
simonious trees each of 418 steps (Fig. 6). The 


consistency index excluding autapomorphies for 
these trees is 0.688, which is higher than the 


Parsimony analysis of matK sequences 


expected value for 31 taxa (Sanderson & Dono- 


ghue, 1989; Table 2). 


VARIABILITY ANALYSES 


Comparisons between genes. The average 
number of nucleotide differences per site in pairwise 
comparisons of entire (1518-1521 bp) matK se- 
quences for Bensoniella oregona, Saxifraga in- 
tegrifolia, and Sullivantia oregana is 0.068. This 
value is 3.2 times greater than the average 0.021 
nucleotide differences per site observed in pairwise 
comparisons of entire rbcL sequences for these 
same species. Comparisons of partial (747 845 
bp) matK sequences with rbcL (1392 bp) among 
gymnosperms (Pinus- Cunninghamia, Cunning- 
hamia-W'iddringtonia, Widdringtonia- Junip- 
erus, and Juniperus- Microbiota) similarly reveals 
an average 3.4 times greater level of nucleotide 
differences per site in matK than in rbcL (P. Gadek 
& C. Quinn, pers. comm.). 

Extending pairwise comparisons of nucleotide 
variability to 25 saxifragaceous taxa for which 
1078 bp of matK and all of rbcL has been se- 
quenced gives an average 3.1-fold greater number 


of nucleotide differences per site in matK as com- 


pared to rbcL. The lowest ratio (1.2 : 1) of nucle- 
otide differences per site in this comparison oc- 
curred in two pairwise comparisons between 
members of the Heuchera group of genera (KL 
mera- ТеШта and Elmera - Heuchera) that are only 
weakly differentiated by sequence data (Figs. 2 and 
5). The highest ratio (6.2 : 1) of matK to rbcL 
nucleotide differences per site also occurred within 
a single group of genera, the Boykinia group, 
between the well-differentiated genera Bolandra 
and Jepsonia (Figs. 2 and 5). When the percentage 
of base positions that are variable is compared 
across these 25 taxa, matK has 3.2 times as many 
variable base positions and 2.7 times as many po- 
tentially informative characters as does rbcL (Table 
3). The greater level of variation in matK as com- 
pared to rbcL also extends to amino acids. Whereas 
5% of amino acid positions are variable in rbcL, 
in matK the figure is 59% (Table 3, Fig. 7). Al- 
though regions of high variability and regions that 
are more conserved are apparent in matK se- 
quences, variable sites appear to be fairly uniform 
in distribution throughout the 5' portion of matK 
that we have sequenced (Fig. 7). Character-state 
reconstructions over a most parsimonious tree for 
the matK and rbcL data sets reveal that the vari- 
ability in nucleotide substitutions is partitioned much 
more evenly in matK with regard to transition : 
transversion and codon position ratios compared 
to rbcL (Table 3). 

The average number of nucleotide differences 
per site in pairwise comparisons of matK sequences 
(two-thirds complete) from Coriandrum, Griseli- 
nia, and Hedera are 1.3-fold greater than the 
average number observed in comparisons of entire 
ndhF sequences for these same species. The in- 
formation content per gene is comparable between 
ndhF and так, however, given that ndAF is 
approximately 1.4 times as long as complete matK 
sequences. In contrast, pairwise comparisons be- 
tween matK sequences from Gilia leptalea, G. 
scopulorum, and G. splendens and sequences from 
the combined nuclear 175-1 and ITS-2 regions 
indicate an average 1.9-fold greater level of nu- 
cleotide differences per site in the ITS regions than 
in matK. The information content of matK is much 
greater than that of ITS, however, given that matK 





—— 


FiGURE 5. 


One of 24 most parsimonious trees for Saxifragaceae s. s. resulting from analysis of combined matK 


and rbcL sequences and cpDNA restriction site data for 43 taxa (combined matrix-3). Base substitutions (ACCTRAN) 
are indicated above branches and are partitioned by data set in the following order: mat K-rbcL-cpDNA restriction 
sites. Bootstrap and decay values are indicated below branches, respectively. Dashed lines represent branches that 
are not supported by all most parsimonious trees; in these instances the decay value is zero and is not indicated 


following the bootstrap value below the branch. 
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TABLE 2. 
gaceae s.s. (45 taxa) and Сша (31 taxa). 


Measure 


Number of characters 

Percent of characters variable 

Percent of characters potentially informative 

Consistency index (CI) 

CI excluding autapomorphies 

Expected CI excluding autapomorphies (Sanderson & 
Donoghue, 1989) 

Retention index (RI) 

Homoplasy excess ratio (HER) 

Transition : transversion ratio 

Codon position ratio 


is approximately 3.1 times longer than the ITS 
regions combined. 


Variation in the noncoding trnK intron. То 
date, phylogenetic analyses using the trn К region 
have concentrated on matK; however, the trn К 
intron regions flanking matK (Fig. 1) may also 
have phylogenetic potential. Although we have not 
used the intron regions between the 5' trn K exon 
and matK (5' intron region) and matK and the 3' 
trnK exon (3' intron region) for phylogenetic anal- 
yses, we have sequenced the major portion of these 
regions for Bensoniella oregona, Saxifraga in- 
tegrifolia, and Sullivantia sullivantii. The 5' in- 
tron region is approximately 720 bp with an av- 
erage 0.059 nucleotide differences per site in 
pairwise comparisons of these species. The 3' re- 
gion is approximately 200 bp with an average 
0.116 nucleotide differences per site. For proper 
alignment of these three sequences, each region 
required the insertion of five gaps of one to nine 
bp. The observed variability in these regions flank- 
ing matK is slightly less than matK itself for the 
5' flanking region but considerably higher for the 
3' flanking region. Both regions may provide ad- 
ditional, useful phylogenetic information. 


Variation in Saxifragaceae s. s. and Gilia 


matK matrices. Аз reconstructed over one of 
their most parsimonious trees (Figs. 2 and 6, re- 
spectively), the number of transitions was essen- 
tially equivalent to the number of transversions 


observed in both the та К data set for Saxifra- 


= 
FIGURE 6. 
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Summary of descriptive measures and indices of variation in our matK sequence matrices for Saxifra- 


Saxifragaceae s.s. Gilia 

1078 1083 

44 26 

25 16 
0.735 0.785 
0.583 0.688 
0.341 0.423 
0.792 0.871 
0.727 0.827 
1:1.09 1:1.05 
1:17:1:21.58 1.44:1:1.87 


gaceae s. s. and Gilia (Table 2). As expected, 
substitutions at the second codon position were less 
frequent than those at the first or third positions. 
However, the numbers of substitutions at the first 
and second codon positions were only slightly lower 
than the number observed at the third position in 
both the Saxifragaceae s. s. and the Gilia matK 
data sets (Tables 2 and 3). In contrast, їп rbcL 
the number of third position substitutions is much 
higher than the numbers of substitutions at the first 
and second codon positions, as is typical of protein- 
coding regions with strong functional constraints 
(Table 3; Donoghue et al., 1992; Smith et al., 
1993; see also Steele & Vilgalys, 1994). 

The randomization test of Archie (19892) in- 
dicates that variation in the Saxifragaceae s. s. and 
Gilia matK data sets is substantially non-randomly 
structured. The most parsimonious tree length (Fig. 
2; 842 steps) obtained from analysis of the matK 
sequences for Saxifragaceae s. s. is far removed 
from the range (1188 to 1213 steps) and mean 
(1200 steps) of the most parsimonious tree lengths 
derived from 100 random permutations of these 
data. Likewise, the most parsimonious tree length 
(418 steps) obtained from analysis of the matK 
sequences for Gilia is far removed from the range 
(838 to 859 steps) and mean (847 steps) of the 
most parsimonious tree lengths derived from 100 
random permutations of the Gilia data matrix. 
Because HER (Archie, 1989b) measures congru- 
ence among characters and departure from ran- 
domness (i.e., HER equals one when data are com- 


One of 36 most parsimonious trees from analysis of matK sequences for Gilia and related genera. 


Base substitutions (ACCTRAN) are indicated above branches. Bootstrap and decay values are indicated below branches, 
respectively. Dashed lines represent branches that are not supported by all most parsimonious trees; in these instances 
the decay value is zero and thus is not indicated following the bootstrap value below the branch. Letters (A-D) denote 
the distribution of specific indels referenced in Appendix 2. 
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Comparison of sequence variation between matK and rbcL for the same suite of 25 saxifragaceous 


taxa. Taxa included in this comparison are indicated in Appendix 1. 


Comparison 
Number of characters 
Percent of characters variable 
Percent of characters potentially informative 
Percent of amino acid positions variable 
Number of most parsimonious trees 
Consistency index (CI) 
CI excluding autapomorphies 
Retention index (RI) 
Transition : transversion ratio 
Codon position ratio 
RI by codon position 


pletely congruent and approaches zero as data 
approach randomness in the distribution of char- 
acter states), we use HER as an indicator of the 
degree of hierarchical structure present in our data 
matrices. Values of HER are 0.727 and 0.827 for 
our Saxifragaceae s. s. and Gilia matK data sets, 
respectively. These values indicate that homoplasy 
and departure from hierarchical structure in these 
matK data are approximately 27.3% (in Saxifra- 
gaceae s. s.) and 17.3% (in Gilia) of that present 
in randomly structured data sets possessing the 
same character-state distributions as our original 
matrices. 


DISCUSSION 
PHYLOGENETIC ANALYSES 


Saxifragaceae 5. 5. Several recent studies have 
presented phylogenetic trees based on rbcL se- 
quences and cpDNA restriction sites for members 
of Saxifragaceae s. s. In addition, the implication 
of these cpDNA-based phylogenetic trees regarding 
trends in chemical, morphological, and cytological 
evolution within the family have been discussed. 
Herein we focus on comparisons between previous 
phylogenetic trees based on rbcL sequences and 
cpDNA restriction site variation (Morgan & Soltis, 
1993; Soltis et al., 1993) and relationships sug- 
gested by matK sequences when analyzed both 
independently and combined with the other two 
data sets. 

Strong support for the monophyly of Saxifra- 
gaceae s. s. has been demonstrated by an extensive 
analysis of rbcL sequences representing a diverse 
array of dicots (Morgan & Soltis, 1993), as well 
as with rpl2 intron data (Downie et al., 1991). 
Parsimony analysis of 1078 bp of matK sequence 
for 45 taxa also reveals a well-supported Saxifra- 
gaceae s. s. (Fig. 2). The members of Saxifragaceae 


matK rbcL 

1078 1398 

38 12 

16 7 

59 5 

3 24 

0.790 0.712 
0.635 0.580 
0.736 0.699 
1:1.06 1.41:1 
1:22:15: 1.57 1.26:1:6.17 
0.76; 0.71; 0.73 0.57; 0.63; 0.75 


s. s. share seven unique base substitutions, are 
united in all trees found during 100 bootstrap rep- 
lications, and form a clade in all trees up to at 
least six steps longer than the most parsimonious 
trees (Fig. 2), relative to the two outgroup taxa 
chosen. 

Within Saxifragaceae s. s., several lineages and 
groups of genera (i.e., the Boykinia, Heuchera, 
and Leptarrhena/Tanakaea groups, and two lin- 
eages of Saxifraga) are well supported by matK 
sequences, whereas other groups of genera (i.e., 
the Darmera and Chrysosplenium/ Peltoboykinia 
groups) are less well supported but consistently 
united in all most parsimonious trees (Fig. 2). Al- 
though we have increased both the number of 
nucleotides and the number of taxa in the matK 
matrix by almost 50% over an earlier analysis 
(Johnson & Soltis, 1994), relationships among all 
of the major lineages noted above remain unre- 
solved. Low support for those branches uniting the 
major groups of genera was also evident in inde- 
pendent analyses of rbcL sequences and cpDNA 
restriction sites (Soltis et al., 1993) and in the 
combined analyses of all three cpDNA data sets 
(Figs. 3-5). Whereas the inability of these three 
cpDNA data sets to elucidate relationships among 
the major groups of genera in Saxifragaceae s. s. 
may seem unsatisfactory from both a cladistic and 
a taxonomic standpoint, it nonetheless is significant 
in that it supports the occurrence of a rapid ra- 
diation early in the evolutionary history of Saxi- 
fragaceae s. s., an event also suggested by other 
lines of evidence (e.g., host preferences of Puc- 
cinia rusts; Savile, 1975). 

In contrast to the lack of support among the 
major groups of genera, relationships within some 
of these groups are well supported by matK se- 
quences. Phylogenetic relationships among mem- 
bers of the Boykinia group are, for example, most- 
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ly well defined and strongly supported by bootstrap 
values of 100 and decay values of four or higher 
(Fig. 2). Significantly, the degree of support for 
generic- and species-level relationships within the 
Boykinia group based on matK sequences is com- 
parable to that obtained via analysis of cpDNA 
restriction sites (Soltis et al., 1993); these same 
strongly supported relationships are also seen in 
the analysis of the three combined cpDNA data 
sets (Fig. 5). Trees based on matK sequences (Fig. 
2) agree with those obtained from analysis of cpDNA 
restriction sites: in recognizing a strong relationship 
between Telesonix and the enigmatic Jepsonia; in 
suggesting that Sullivantia is sister to Bolandra, 
Boykinia, and Suksdorfia; and in indicating the 
polyphyly of Suksdorfia. 

Phylogenetic analysis of matK sequences not 
only indicates that Saxifraga is polyphyletic, com- 
prising two well-supported lineages, but also reveals 
considerable differentiation among species within 
each of these two lineages (Fig. 2). Saxifraga 
integrifolia, S. ferruginea, S. mertensiana, and 
S. punctata (— S. nelsoniana) have traditionally 
been placed in section Micranthes (Engler & 
Irmscher, 1916-1919). Gornall (1987), however, 
considered S. mertensiana so distinctive morpho- 
logically that he placed it in its own section, Het- 
erisia. Significantly, S. mertensiana is well re- 
moved from these three species of section 
Micranthes in all most parsimonious trees derived 
from analysis of matK sequences (Fig. 2) and dif- 
fers from these species by a minimum of 135 
nucleotide differences and 2 indels. The distant 
relationship of S. mertensiana to members of sec- 
tion Micranthes (Fig. 2) is also suggested by in- 
dependent analyses of rbcL sequences and cpDNA 
restriction sites (Soltis et al., 1993), as well as 
analyses of the combined cpDNA data sets (Figs. 
3—5). Additionally, the sister relationship suggested 
by analysis of matK between S. mertensiana and 
S. cernua/S. oppositifolia (of sections Saxifraga 
and Porphyrion, respectively) is also suggested by 
rbcL sequences (Soltis et al., 1993). Given the 
large number (35) of base substitutions uniting S. 
mertensiana with S. cernua/S. oppositifolia and 
an even larger number (52) of autapomorphies in 
S. mertensiana, we cannot discount the possibility 
that this relationship is an artifact produced by 
attraction of long branches during parsimony anal- 
ysis (Felsenstein, 1978). Nonetheless, the level of 
differentiation in matK observed among the small 
sample of Saxifraga species included herein (Fig. 
2; range of 44 to 162 base substitutions in pairwise 
comparisons) suggests that additional comparative 
matK sequencing within Saxifraga should be ex- 


tremely useful at defining lineages and relationships 
among many of the 300 species that are presently 
recognized in this morphologically diverse genus. 

A fourth well-supported, albeit small, lineage in 
Saxifragaceae s. s. comprises the sister genera Lep- 
tarrhena and Tanakaea (Fig. 2). Previous analyses 
of matK sequences (Johnson & Soltis, 1994) and 
rbcL sequences (Soltis et al., 1993), as well as the 
analysis of the three combined cpDNA data sets 
(Figs. 3-5), not only support this relationship, but 
also suggest that these two genera are sister to the 
Boykinia group. This relationship to the Boykinia 
group is generally not strongly supported, however, 
and is not revealed in all most parsimonious trees 
from analysis of our expanded matK data set (Fig. 
2), analysis of combined cpDNA data matrix-3 (Fig. 
5), or analysis of cpDNA restriction site data (Soltis 
et al., 1993). Matrix-1 of the combined cpDNA 
data sets (Fig. 3) provides the strongest support 
for a sister relationship between Leptarrhena/ 
Tanakaea and the Boykinia groups (91% boot- 
strap and decay value >6). However, the small 
number of taxa in this matrix does not include what 
may be critical taxa for defining the true affinities 
of Leptarrhena/ Tanakaea. For example, matrix-2 
of the combined cpDNA data sets includes just two 
additional genera, Astilbe and Chrysosplenium, 
yet support for Leptarrhena/ Tanakaea as sister 
to the Boykinia group declines to 71% bootstrap 
and a decay value of 3 in analyses of this data set 
(Fig. 4). 

The Heuchera group of genera is strongly sup- 
ported by matK sequences, and several lineages 
are differentiated within this group. As was ob- 
served in an earlier analysis of cpDNA restriction 
site data (Soltis et al., 1991), Неисйега and Mi- 
tella appear polyphyletic with species located in 
several independent lineages. Both cpDNA restric- 
tion sites and matK sequences also provide strong 
support for a clade comprising Bensoniella, Lith- 
ophragma, and Tolmiea. matK and cpDNA re- 
striction site data sets do not concur, however, in 
the affinities of Bensoniella, Lithophragma, and 
Tolmiea within the Heuchera group. Analysis of 
cpDNA restriction sites suggests that Bensoniella, 
Lithophragma, and Tolmiea are the sister to all 
other members of the Heuchera group. In contrast, 
analysis of matK sequences suggests that Benson- 
iella, Lithophragma, and Tolmiea are the sister 
to Conimitella, Elmera, Tellima, Tiarella, and 
most species of Heuchera and Mitella; all of these 
taxa then appear as the sister to Heuchera hir- 
sutissima, Mitella diphylla, and M. nuda (Fig. 
2). The latter three species are well differentiated 
from other species of Heuchera and Mitella in the 
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cpDNA restriction site study (Soltis et al., 1991). 
The unusual position of these three species in the 
Heuchera group suggested by matK sequences 
requires further investigation. 

Analysis of matK sequences provides support 
for Chrysosplenium and Peltoboykinia as sister 
genera (Fig. 2), a relationship that is also suggested 
by rbcL sequences but not by all shortest trees 
based on cpDNA restriction sites (Soltis et al., 
1993). Not only is the degree of bootstrap support 
for this relationship high (88%), but these two 
genera are also united in all trees up to 4 steps 
longer than the most parsimonious tree and receive 
similar support in analyses of the combined cpDNA 
data sets (Figs. 4, 5). The sister relationship be- 
tween these two genera is significant because both 
genera are morphologically distinctive and their 
affinities are problematic. Chrysosplenium is par- 
ticularly well defined morphologically and contains 
a large number of species. It is thus noteworthy 
that the branch uniting the two Chrysosplenium 
species is among the longest observed on the matK 
phylogenetic tree (Fig. 2), with 47 substitutions 
and two unique indels uniting these species. Com- 
parison of a few partial sequences from other spe- 
cies of Chrysosplenium (D. Soltis, unpublished) 
suggests that matK sequences will be useful for 
elucidating relationships within this genus. 

The Darmera group of genera is also recognized 
based on analysis of matK sequences, although this 
group is not as strongly supported (76% bootstrap, 
decay value of 2; Fig. 2) as are some of the other 
major groups of genera in Saxifragaceae s. s. The 
Darmera group is, however, well supported by 
cpDNA restriction sites (99% bootstrap, 26 steps 
of decay; Soltis et al., 1993) and also appears in 
our analyses of the three combined cpDNA data 
sets (Figs. 3-5). In contrast, rbcL sequences fail 
to recognize the Darmera group, although this 
appears to be the result of the small number of 
base substitutions in rbcL at this level of analysis 
rather than indicative of a strongly supported op- 
posing view of relationships (Soltis et al., 1993). 
As within other major groups of genera in Saxi- 
fragaceae, matK and cpDNA restriction sites again 
provide comparable pictures of relationships within 
the Darmera group. Both cpDNA restriction sites 
and matK sequences indicate that Bergenia and 
Mukdenia are sister taxa and that Astilboides, 
Darmera, and Rodgersia are closest relatives. 


Gilia. 


sequence for 20 polemoniaceous taxa, Gilia ap- 


In a recent analysis of 661 bp of matK 


peared monophyletic (Steele & Vilgalys, 1994); 
however, Steele and Vilgalys included only four 
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species of Gilia from two sections, Arachnion and 
Gilia. Our broader analysis of matK sequences 
from species representing all six sections of Gilia 
reveals that species of Gilia are scattered through- 
out the temperate radiation of Polemoniaceae (Fig. 
6). Species of Gilia appear in several well-separated 
lineages that also include members of virtually all 
of the other temperate genera included in this 
analysis (i.e., Allophyllum, Collomia, Eriastrum, 
Ipomopsis, Langloisia, Navarretia, Phlox, and 
Polemonium). Thus, our analysis clearly indicates 
that Gilia is polyphyletic. Although few of the 
species of Gilia included in the matK analysis were 
identical to those sampled in a recent ITS sequence 
analysis of Polemoniaceae (Fig. 8; Porter, 1993), 
the sectional coverage of Gilia in the two studies 
is comparable. Both matK and ITS phylogenetic 
trees agree in suggesting similar relationships among 
sections of Gilia and allied genera (compare Figs. 
6 and 8). Furthermore, relationships that are only 
weakly supported by matK sequence data, such 
as the branching patterns among basal nodes (Fig. 
6), are also only weakly supported by ITS sequenc- 
es (Fig. 8). 

Analyses of matK sequences strongly support a 
clade of “true gilias?” comprising sections Gilia, 
Arachnion, and the species G. scopulorum and G. 
stellata of section Saltugilia (Fig. 6). A close re- 
lationship among sections Arachnion, Gilia, and 
Saltugilia has been previously suggested (Grant, 
1954, 1959; Grant & Grant, 1956а, b). Further- 
more, although both G. scopulorum and G. stellata 
have been allied with the G. splendens group of 
section Saltugilia (Grant & Grant, 1954, 19565; 
Grant, 1959; Day, 1993b), the distinctness of the 
former species with regard to the G. splendens 
group has also been recognized (Grant & Grant, 
1954, 19565). ITS sequences also define a clade 
of true gilias similar in composition to that defined 
by matK sequences. However, in the ITS analysis 
G. stellata and G. scopulorum are united in a 
single clade as sister to both sections Gilia and 
Arachnion (Fig. 8), rather than placed apart as 
suggested by matK sequences (Fig. 6). 

Gilia tenerrima (section Giliastrum) appears as 
a weakly supported sister to the true gilias in anal- 
yses of both matK (Fig. 6) and ITS (Fig. 8) se- 
quences. The morphology of G. tenerrima is unique 
among gilias, and its affinities have been sought 
among species currently recognized in section Kel- 
loggia (e.g., Mason & Grant, 1948; Grant, 1959) 
or section Giliastrum (e.g., Grant & Grant, 1954; 
Day, 1993a). The polyphyly of section Giliastrum 
is indicated by matK sequences based on the in- 
clusion of only one other species of section Gi- 
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Strict consensus of two most parsimonious trees for Polemoniaceae resulting from parsimony analysis 


of nuclear rDNA ITS sequences (length — 1074; CI — 0.47). Poorly supported branches, as indicated by bootstrap, 
jackknife, decay, and skewness (Hillis & Huelsenbeck, 1992) analyses, are represented by broken lines. A few terminal 
branches have been collapsed to emphasize relationships discussed herein; in these cases, the value in parentheses to 
the right of the generic name represents the number of species that appear along the branch. Data analysis and tree 


courtesy of Porter (1993). 


liastrum, G. filiformis, in this analysis. Gilia fili- 
formis is strongly allied with a lineage including 
Phlox in the matK analysis (Fig. 6) and is well 
separated from G. tenerrima by 65 bases in a 
pairwise comparison of nucleotide differences. The 
polyphyly of section Giliastrum is also strongly 
suggested by ITS sequences (Fig. 8). 


A well-supported sister lineage to G. tenerrima 
and the clade of true gilias is a strongly supported 
group comprising Allophyllum, Collomia, Navar- 
retia, and species of Gilia section Kelloggia. These 
taxa share a 6-bp deletion and 10 unambiguous 
base substitutions on the matK phylogenetic tree 
(Fig. 6). Gilia capillaris and G. leptalea (section 
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Kelloggia; Mason & Grant, 1948; Day, 1993a) 
form a well-defined lineage within this group and 
are differentiated from their sister genera by eight 
unambiguous base substitutions. These two species 
have, until recently (Day, 1993a), been recognized 
as a distinct species group in section Saltugilia 
(Grant & Grant, 1954; Grant, 1959). Analyses of 
matK sequences thus support the recognition of 
section Kelloggia, although the close relationship 
of this section to Allophyllum, Collomia, and Ма- 
varretia has not been considered in recent clas- 
sifications. It is noteworthy, however, that Gray 
(1873) and Brand (1907) placed G. leptalea and 
G. sinistra, respectively, in Collomia (G. sinistra 
is also in section Kelloggia but was not included 
in this analysis of matK sequences). Species from 
section Kelloggia were not included in Porter's 
(1993) ITS analysis, and ITS sequences place Na- 
varretia, Collomia, and Allophyllum as progres- 
sively basal sisters to G. tenerrima and the true 
gilias rather than united as a single lineage as 
suggested by matK sequences (Fig. 6). This ap- 
parent discrepancy between the ITS and matK 
trees may simply be a result of inappropriate root- 
ing of the ITS sequence clade that includes Allo- 
phyllum and the true gilias due to the inclusion of 
very divergent sequences in the ITS matrix (M. 
Porter, pers. comm.). For example, the Alloph yl 
lum-true gilia clade in the ITS tree (Fig. 8) could 
be rerooted at the midpoint of the branch between 
G. tenerrima and Navarretia to provide essentially 
the same view of relationships depicted by the 
matK sequence tree (Fig. 6). 

The core members of section Saltugilia, С. 
splendens, G. australis, and G. caruifolia, were 
also not included in Porter's (1993) ITS sequence 
analysis but are strongly supported by matK data 
as a unique lineage well separated from all other 
lineages of Gilia sequenced to date. A close rela- 
tionship among these three species was first for- 
mally recognized by Grant & Grant (1954). How- 
ever, the great divergence between this lineage and 
the true gilia clade has not been previously con- 
sidered. We are currently further investigating the 
affinities of the С. splendens group because its 
placement as sister to the Allophyllum and true 
gilia clades in the matK analysis is only weakly 
supported (Fig. 6). 

Finally, this analysis of matK sequences rec- 
ognizes G. hutchinsifolia and G. leptomeria (both 
of section Giliandra) as a well-supported clade that 
is also well separated from other gilias on the most 
parsimonious tree (Fig. 6). A clade including G. 
hutchinsifolia is similarly well removed from other 
gilias in analyses of ITS sequences (Fig. 8). Fur- 


thermore, both matK and ITS sequences concur 
in placing G. hutchinsifolia and related species as 
sister to Eriastrum, Langloisia, and Ipomopsis 
(Fig. 6), although this relationship is not strongly 
supported in either analysis. 


The 


ability of matK sequences to reveal even fine-scale 


Polyploid origin of Saxifraga osloensis. 


relationships in taxonomic groups is illustrated by 
an example from the genus Saxifraga. Saxifraga 
osloensis is a tetraploid of evolutionary interest 
because it has been proposed by some as the only 
saxifrage that can confidently be considered an 
alloploid, although its parentage is controversial 
(reviewed in Webb & Gornall, 1989). Further- 
more, it is also considered to be a classic example 
of a species of postglacial allopolyploid origin. Kna- 
ben (1954) hypothesized that S. osloensis is de- 
rived from two closely related diploid species, S. 
tridactylites and S. adscendens. C. Brochmann 
(pers. comm.), in contrast, has suggested that 5. 
osloensis may be an autopolyploid derived from 
only 5. tridactylites. Webb & Gornall (1989) note, 
however, that morphologically, 5. osloensis more 
closely resembles S. adscendens. A comparison of 
only 750 bp of matK sequence shows that S. 
osloensis is identical in sequence to 5. adscendens, 
and the two species differ from S. tridactylites by 
11 nucleotide differences. Thus, either S. osloensis 
is an autoploid derived from S. adscendens, or if 
an alloploid, 5. adscendens was the maternal par- 


ent. 


VARIABILITY 


Indels. 


sets suggest that a matK sequence matrix with 


The Saxifragaceae s. s. and Gilia data 


enough taxonomic breadth to show informative base 
substitutions is also likely to contain indels. For 
example, the 45-taxon Saxifragaceae s. s. matK 
sequence matrix contains 10 indels (Fig. 2; Ap- 
pendix 2), the 31-taxon Gilia matK sequence ma- 
trix contains four indels (Fig. 6, Appendix 2), a 
76-1ахоп Apiaceae-Pittosporaceae matK se- 
quence matrix contains 12 indels (Plunkett, 1994), 
and a seven-taxon Cupressaceae matK sequence 
matrix contains two indels (P. Gadek and C. Quinn, 
pers. comm.). All of these indels are small, ranging 
in size from 3 to 9 bp. Because apparently identical 
indels may have multiple origins in unrelated taxa 
(Golenberg et al., 1993), we have not appended 
these indels as additional characters in the data 
matrices. However, only one of the 16 indels ob- 
served in the above matrices (indel E in the Sax- 
ifragaceae; Fig. 2, Appendix 2) appears homopla- 
sious when mapped on trees derived from analyses 


168 


Annals of the 
Missouri Botanical Garden 





of base substitutions alone. Furthermore, given the 
poor resolution in the placement of Astilbe and the 
possibility of long-branch attraction in the place- 
ment of Saxifraga mertensiana on the most par- 
simonious trees for Saxifragaceae s. s. (Fig. 2), 
even this indel may have had a single origin. Al- 
though it has been our experience with matK that 
indels can usually be aligned with a high degree of 
confidence if the reading frame is taken into con- 
sideration, we recognize that uncertainties in the 
alignment of indels is а potential source of error 
in analyses (Ritland & Clegg, 1987). The amount 
of error is likely to be minimal, however, given 
that few bases are involved in indels relative to the 
number of potentially informative characters that 
are unambiguously aligned. 


Base substitutions. Most base substitutions in 
the Saxifragaceae s. s. and Gilia matK data sets 
are hierarchically structured as evidenced by the 
randomization test of Archie (19892). Further- 
more, the homoplasy observed in these large matK 
data matrices is not only modest (Table 2), but is 
also evenly distributed on the most parsimonious 
trees (data not shown). Together with the strong 
bootstrap support and high decay values for many 
of the clades present in the phylogenetic trees for 
Saxifragaceae s. s. and Сша (Figs. 2 and 6, re- 
spectively), these observations suggest that both 
matrices have been constructed within an appro- 
priate range of taxonomic hierarchy. Thus, al- 
though a minimum of 85% (in Saxifragaceae s. s.) 
and 73% (in Gilia) of the potentially informative 
characters in these matK sequence matrices have 
experienced multiple hits (based on the number of 
potentially informative characters with 3 or 4 char- 
acter states), most of these multiple hits convey 
phylogenetic information. Whereas multiple sub- 
stitutions per site may be problematic in matK 
analyses at broad taxonomic levels, they do not 
appear to be unduly problematic in analyses of 
intergeneric and interspecific relationships. 


PCR AND SEQUENCING PRIMERS 


To stimulate the use of matK sequence data in 
other groups, we have tried to assess the broad 
applicability of the PCR and sequencing primers 
described herein. The trn K coding regions are very 
conserved among angiosperms, and PCR primers 
trnK-2R and trn K-3914F (Table 1) have been 
used by ourselves and others to amplify matK from 
dicots such as Annonaceae, Apiaceae, Asteraceae, 
Balsaminaceae, Brassicaceae, Cornaceae, Erica- 
ceae, Grossulariaceae, Lauraceae, Magnoliaceae, 
Malpighiaceae, Malvaceae, Polemoniaceae, Rham- 


naceae, Rosaceae, Sarraceniaceae and Saxifraga- 
ceae. These primers have also been used to amplify 
matK from monocots such as lridaceae, Junca- 
ceae, and Orchidaceae, although the latter family 
requires twice the standardly employed amount of 
MgCl, to achieve successful amplification via PCR 
(M. Chase, pers. comm.). For routine sequencing 
of monocots, however, it may be desirable to use 
Learn's original trnK-3914F primer (Table 1) that 
provides even greater homology to the 5' trnK 
exon in these plants. Outside of angiosperms, prim- 
ers trnK-3914F and trn K-2R have been used suc- 
cessfully to amplify matK from Cupressaceae (P. 
Gadek & C. Quinn, pers. comm.). The similarity 
of these amplification primers to published trnK 
sequences of Pinus (Lidholm & Gustafsson, 1991) 
is high for trnK-3914F, but somewhat lower for 
trnK-2R. It may therefore be desirable to modify 
trnK-2R for routine use in conifers. Both trnK- 
3914F and trn K-2R also exhibit high similarity to 
a published sequence for Marchantia (Umesono 
et al., 1988) and may be useful for amplifying 
matK from bryophytes as well. The PCR ampli- 
fication primers rps16-4547F and psbA-R are use- 
ful as alternatives to primers trnK-3914F and trnK- 
2R, respectively, in some circumstances, but these 
primers may not prove as widely applicable (Fig. 
l; Table 1). For example, rps16 (Fig. 1) is not 
present in Marchantia (Umesono et al., 1988), 
and psbA (Fig. 1) is duplicated between rps16 and 
the 5' trnK exon in at least some species of Pinus 
(Lidholm & Gustafsson, 1991). More recently, 
primers trnK-253F, trnK-7 10F, and trnK-2000R 
have been used in various combinations with the 
standard PCR primers described above to obtain 
shorter fragments that contain matK from taxa in 
Apicaceae (Plunkett, 1994) and Saxifraga that 
otherwise have yielded poor matK PCR products. 
For sequencing, primers matK-1235R, matK- 
1470R, and matK-1412F (Table 1) appear fairly 
conserved, at least among dicots, and enable the 
generation of sequences of approximately 1100 
contiguous bp of matK beginning at the 5' end. 
We have used these sequencing primers success- 
fully in Apiaceae, Brassicaceae, Cornaceae, Eri- 
caceae, Grossulariaceae, Polemoniaceae, and Sax- 
ifragaceae. Other sequencing primers are less 
conserved, have not been broadly tested, or, in the 
case of primer matK-11068R, the 3' nine nucleo- 
tides are duplicated within the annealing site of 
matK-1470R, and have annealed at both sites in 
some Saxifragaceae and Polemoniaceae, but not 
Apiaceae (C. Plunkett, pers. comm.). Of the se- 
quencing primers we have used in angiosperms, 
matK-1470R has also worked in the Cupressaceae 
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(P. Gadek & C. Quinn, pers. comm.) and thus 
appears to be a good choice for initial sequencing 
of matK in diverse plant groups. Given the rapid 
rate of matK sequence evolution, we have found 
that at least some primers will need to be designed 
specifically for a given group in order to sequence 
all of matK. For example, whereas matK-1168R, 
matK-1235R, and matK-1470R worked well in 
nearly all members of Saxifragaceae s. s. sequenced 
for this study, comparative sequencing within just 
the large genus Saxifraga has required the syn- 
thesis of three additional sequencing primers (trnK 
710F, matK-1176F, and matK-1412R; D. Soltis, 
unpublished). The need to design group specific 
sequencing primers is likely to be true of any gene 
as large as matK, however, that also evolves at a 
rate useful for intrafamilial and intrageneric phy- 
logenetic reconstruction. 
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APPENDIX l. Species included in the Gilia matK sequence matrix (referenced to Fig. 6), the Saxifragaceae s.s. 
matK sequence matrix (referenced to Fig. 2), the three Saxifragaceae s.s. matrices composed of combined cpDNA 
restriction sites (г. s.) and matK and rbcL sequences (таїгіх-1 is referenced to Fig. 3; matrix-2 is referenced to Fig. 
4; and matrix-3 is referenced to Fig. 5), and the comparison between rbcL and matK sequences for 25 species 
(referenced to Table 3). GenBank accession numbers are reported for all matK and rbcL sequences, whereas cpDNA 
restriction site data are located in Soltis et al. (1991) and Soltis et al. (1993). 











Family 
Species Voucher / citation Data type GenBank  Figure/Table 
GROSSULARIACEAE 
Ribes aureum Pursh Soltis & Soltis 2220, WS так L34153 2.3.4.573 
Morgan & Soltis, 1993 rbcL L11204 3,4, 5/3 
Soltis et al., 1993 r. S. — 3,4,5 
Tetracarpaea tasmanica Hook. f. Jordon s.n., HO matK L34154 2, 5/3 
Morgan & Soltis, 1993 rbcL L11207 5/3 
POLEMONIACEAE 
Allophyllum gilioides (Benth.) Grant & Johnson 92012, WS matK L34176 6 
Grant 
Bonplandia geminiflora Cav. Patterson s.n., WS matK L34179 6 
Cantua pyrifolia Juss. Patterson s.n., WS matK L34180 6 
Collomia linearis Nutt. Johnson 92045, WS matK L34188 6 
Eriastrum densifolium (Benth.) Mason Johnson 92090, WS matK L34184 6 
Gilia achilleifolia Benth. Schultz s.n., WS matK L34175 6 
Gilia angelensis Grant Johnson 92013, WS matK L34177 6 
Gilia australis (Mason & Grant) Grant & Johnson 92021, WS matK L34178 6 
Grant 
Gilia capillaris Kellogg Johnson 93104, WS matK L34181 6 
Gilia capitata Sims Johnson 92015, WS matK L34182 6 
Gilia caruifolia Abrams Johnson 93096, WS matK L34183 6 
Сша filiformis Gray Johnson 93015, WS matK L34185 6 
Gilia hutchinsifolia Rydb. Johnson 93069, WS matK L34186 6 
Gilia inconspicua (Smith) Sweet R. Johnson 149, WS matK L34187 6 
Gilia leptalea (Gray) Greene Patterson s.n., WS matK L34195 6 
Gilia leptantha Parish Schultz 52503, WS matK L34197 6 
Gilia leptomeria Gray Johnson 93008, WS matK L34196 6 
Gilia ochroleuca Jones Johnson 92022, WS matK L34189 6 
Gilia scopulorum Jones R. Johnson 304, WS matK L34190 6 
Gilia sinuata Benth. Johnson 92004, WS matK L34198 6 
Gilia splendens Dougl. ex Lindl. Johnson 92093, WS matK L34191 6 
Gilia stellata Heller Johnson 93059, WS matK L34199 6 
Gilia tenerrima Gray Johnson 93103, WS matK L34192 6 
Ipomopsis aggregata (Pursh) Grant Johnson 92100, WS matK L34193 6 
Ipomopsis congesta (Hook.) Grant R. Johnson 166, WS matK L34200 6 
Ipomopsis polycladon (Torrey) Grant Johnson 93068, WS matK L34194 6 
Langloisia setosissima (Torr. & Gray) Johnson 93074, WS matK L34201 6 
Greene 
Navarretia intertexta (Benth.) Hook. Glazner 9349, WS matK L34202 6 
Phlox gracilis Greene Johnson 92046, WS matK L34203 6 
Phlox hoodii Richardson Johnson 92001, WS matK L34205 6 
Polemonium californicum Eastw. Johnson 93089, WS matK L34204 6 
SAXIFRAGACEAE 
Astilbe japonica x Chinesensis Johnson s.n., WS matK L34114 2,4 
Astilbe taquetii (Léveillé) Koidzumi Morgan & Soltis, 1993 rbcL L11173 4, 5 
Soltis et al., 1990 T8: — 4, 5 
Astilboides tabularis (Hemsl.) Engl. Univ. Oslo Bot. Gard., matK L34115 2,3,4,5/3 
Norway, O 
Soltis et al., 1993 rbcL U06207 3, 4, 5/3 
Soltis et al., 1993 г. s. — 3,4,5 
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APPENDIX 1. Continued. 


Family 


Species 


Bensoniella oregona (Abrams & Bacig.) 


Morton 


Bergenia cordifolia (Haw.) A. Br. 


Bolandra oregana Wats. 


Boykinia occidentalis Torr. & Gray 
Boykinia rotundifolia Parry 
Chrysosplenium americanum Schw. 
Chrysosplenium iowense Rydb. 
Chrysosplenium tetrandrum (Lund) Fries 
Conimitella williamsii (Eaton) Rydb. 
Darmera peltata (Torr.) Voss 


Elmera racemosa (Wats.) Rydb. 


Heuchera hirsutissima Rosend., Butt & 


Lak. 


Heuchera micrantha Dougl. 


Heuchera rubescens Torr. 


Jepsonia parryi (Torr.) Small 


Leptarrhena pyrolifolia (D. Don) R. Br. 


Lithophragma affine Gray 


Mitella caulescens Nutt. 
Mitella diphylla L. 
Mitella diversifolia Greene 


Mitella nuda L. 


Mitella pentandra Hook. 


Mitella stauropetala Piper 


Johnson & Soltis 
Phylogenetic Inference Using 
matK Sequences 


Voucher / citation 


Soltis & Soltis, s.n., WS 

Soltis & Soltis, s.n., WS 

Soltis et al., 1991 

Komarov Bot. Inst., 
Leningrad, Russia, LE 

Soltis et al., 1993 

Soltis et al., 1993 

Grable 11587, WS 

Soltis et al., 1993 

Soltis et al., 1993 

Grable 11636, WS 

Soltis et al., 1993 

Gornal 0101, UBC 

Morgan & Soltis, 1993 

Soltis et al., 1993 

Soltis et al., 1993 

Wendel s.n., ISC 

Johnson & Soltis, 1994 

Straly 6205, UBC 

Soltis & Soltis 1608, WS 

Soltis et al., 1991 

Soltis & Soltis 2083, WS 

Morgan & Soltis, 1993 

Soltis et al., 1993 

Soltis & Soltis 2234, WS 

Soltis et al., 1993 

Soltis et al., 1991 

Wallace s.n., RSA 

Soltis et al., 1991 

Soltis & Soltis 1949, WS 

Soltis et al., 1990 

Soltis et al., 1991 

Soltis et al., 1991 

Soltis et al., 1991 

Rieseberg 1110, RSA 

Soltis et al., 1993 

Soltis et al., 1993 

Soltis & Soltis 2237, WS 

Morgan & Soltis, 1993 

Soltis et al., 1993 

Pellmyr & Thompson 
s.n., WS 

Soltis et al., 1991 

Soltis & Soltis 1881, WS 

Soltis et al., 1991 

Soltis & Soltis 1857, WS 

Soltis et al., 1991 

Soltis & Soltis 1910, WS 

Soltis et al., 1991 

Johnson & Brunsfeld 
1908, WS 

Soltis et al., 1991 

Grable 11432, WS 

Soltis et al., 1991 

Soltis & Soltis 1856, WS 

Soltis et al., 1991 


matK 
rbcL 
Tass: 
matK 


rbcL 
r; S. 
matK 
rbcL 
ER 
matK 
[..5- 
matK 
rbcL 
ps 
rts: 
matK 
rbcL 
matK 
matK 
1.8, 
matK 
rbcL 
r. S. 
matK 
гђе, 
T.S- 
matK 
E 
так 
rbcL 
[x 
matK 
RS: 
matK 
rbcL 
E. 
matK 
rbcL 
p.t 


matK 


I 
matK 
Ps 
matK 
I5. 
matK 
E 
matK 


dE 
matK 
P 

matK 


r. S. 


Data type GenBank 


L34112 
L34072 


L34116 


U06208 
L34117 
006209 


L34118 
L34119 
1175 


L34120 
L19935 
L34121 

L34122 
L34123 
L11180 
L34124 
U06210 


L34125 


L34126 
L01925 


L34127 


L34128 
U06211 


L34129 
1.1119] 


L34130 


L34131 


L34132 


L34133 


L34134 


L34135 


L34136 
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APPENDIX l. Continued. 


Family 
Species Voucher / citation Data type GenBank — Figure/Table 
Mukdenia rosii (Oliver) Koidzumi Soltis s.n., WS matK L34137 2, 3, 4, 5/3 
Soltis et al., 1993 rbcL U06212 3, 4, 5/3 
Soltis et al., 1993 та — 3,4,5 
Peltoboykinia tellimoides (Maxim.) Hara Soltis s.n., WS matK L34138 2.3.4573 
Soltis et al., 1993 rbcL U06213 3, 4, 5/3 
Soltis et а|., 1993 Y.:8, — 3, 4,5 
Rodgersia pinnata Franch. Soltis s.n., WS matK L34139 2,3,4,5/3 
Soltis et al., 1993 rbcL U06214 3,4,5/3 
Soltis et al., 1993 r. s. — 3,4,5 
Saxifraga cernua L. Soltis et al., 1993 matK L34140 2, 5/3 
Soltis et al., 1993 rbcL U06215 5/3 
Saxifraga ferruginea Grah. Soltis et al., 1993 matK L34141 2,5 
Soltis et al., 1993 rS. — 5 
Saxifraga integrifolia Hook. Soltis & Soltis 2253, WS так L20131 2, 5/3 
Morgan & Soltis, 1993 rbcL L01953 5/3 
Saxifraga mertensiana Bong. Grable 11586, WS matK L34142 2, 3, 4, 5/3 
Soltis et а|., 1993 rbcL U06216 3,4,5/3 
Soltis et al., 1993 Г. 8. -- 3, 4, 5 
Saxifraga oppositifolia L. Soltis et al., 1993 matK L34143 2, 5/3 
Soltis et al., 1993 rbcL U06217 5/3 
Saxifraga punctata L. [= S. nelsoniana Soltis et al., 1993 matK L34144 2.3 4.573 
D. Don] Soltis et al., 1993 rbcL U06218 3,4,5/3 
Soltis et al., 1993 г. 8. — 3, 4,5 
Suksdorfia ranunculifolia (Hook.) Engl. Soltis & Soltis 2308, WS так L34145 2,95 
Soltis et al., 1993 NS — 5 
Suksdorfia violacea А. Gray Soltis & Soltis 2309, WS так L34146 2.5 
Soltis et al., 1993 НЕ 5 
Sullivantia oregana Wats. Soltis et al., 1993 matK L34113 2,3,4,5/3 
Soltis et al., 1993 rbcL U06219 3,4,5/3 
Soltis et al., 1993 г. 8. — 3, 4,5 
Sullivantia sullivantii (Torr. & Gray) Quackenbush s.n., WS matK L20130 2 
Britt. 
Tanakaea radicans Franch. Univ. Brit. Columbia Bot. matK L34147 2,3,4,5/3 
Gard., UBC 
Soltis et al., 1993 rbcL U06220 3, 4, 5/3 
Soltis et al., 1993 r. S. -- 3,4,5 
Telesonix heucheriformis Rydb. Wolf 151, WS matK L34148 2,3,4,5/3 
Soltis et al., 1993 rbcL U06221 3, 4, 5/3 
Soltis et al., 1993 [B — 3,4,5 
Tellima grandiflora (Pursh) Dougl. Soltis & Soltis 2113, УЗ matK L34149 2,5 
[= "northern" type] Soltis et al., 1993 & В; — 5 
Tellima grandiflora (Pursh) Dougl. Soltis & Soltis 2119, 5 matK L34150 2,3,4,5/3 
[= "southern" type] Soltis et al., 1993 rbcL U06222 3, 4, 5/3 
Soltis et al., 1991 r. S. -— 3,4,5 
Tiarella trifoliata L. Ness 533, WS matK L34151 2.5 
Soltis et al., 1991 D. — 5 
Tolmiea menziesii (Pursh) Torr. & Gray Soltis & Soltis 1903, WS matK L34152 2, 3, 4, 5/3 
Soltis et al., 1993 rbcL U06223 3, 4, 5/3 
Soltis et al., 1991 I.S, — 3,4,5 
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APPENDIX 2. Insertion/deletion events (indels) observed in matK sequence matrices following alignment. Indels 
are labeled alphabetically (A-D for СШа and A-J for Saxifragaceae); these labels correspond to those mapped on 
the most parsimonious trees (Fig. 6 for Gilia and Fig. 2 for Saxifragaceae). Dashes represent missing bases associated 
with indels. Dots in the sequence below the reference taxon (Gilia stellata for Gilia and Sullivantia oregana for 
Saxifragaceae) indicate that the same nucleotide present in the reference taxon is also present in the species containing 
the indel (note that all species in which an indel occurs are listed, although the sequence is given for only one of 
these species when more than one species possesses the same indel). P/A represents the number of species with 
nucleotides involved in indels present and absent, respectively. RN, the reference nucleotide, is the position of the 


last sequential nucleotide preceding the indel in unaligned sequences of the reference taxon for each data set. 











Data set 
Indel P/A Representative sp. RN Sequence region 
Gilia 
A 30A Са stellata 270 СААААТСАСАТТ TTTGGGCACAAC ------ ACGAAT TTGTATTATCAA 
LEY PORE RIOR = ase. oYueRTYERETES ААСААС. 2:21: iiis ба 
В ЗОР Са stellata 378 САСТТАСТСААА TCTCATAATTTA CGATCAATTCAT TCAATATTTCCT 
ІА Bonplandia geminifl. 7 ES бен vt cec Eee. Ste Paling Maree. ied au Рита 
C 26P Сша stellata 573 САССААТАТССТ AATTGGAATAAT ATTATTACTACA AAAAAATCTAGT 
ЗА Cillia leptalea — — ....... буз Leaded = SEGA e А a поль Сез C 
Gilia capillaris 
Allophyllum gilioides 
Collomia linearis 
Navarretia intertexta 
D 30A Сша stellata 198  GAAGTATTTATT AAAGATCTTCAA ---GATATTCTA GGTTTGTTCAAG 
LP ТШШ БИ ЦОЙ. агас и, eee TCGTCG...... MN CORE. A 
Saxifragaceae 
A 44P Sullivantia oregana 105 TATGCACTTGCT CATGATCATGTT TTAAATAGATCC ATTTTGTTGGAT 
LA Летасатрава tasmani 1. ваља Т........——— ——————..... е БИ УЕ ИЕ А 
В 43A Sullivantia oregana 141 ATTTTGTTGGAT AATTTTGGT--- TATGACAATAAA TCCAGTTCAATA 
2P Chrysosplenium iowe. 3 s. 0067 ТРУ Bis ooh. GUT aT. natu а C.. 
Chrysosplenium teta. 
C 44P Sullivantia oregana 237 ATTTCTGCTAAT TATTCGAACAAA AATAAATTTTTG GGGCATAACAAG 
LA Suksdorfia ranuncul. ее. б........... ————————— EGO ла Ва 
р 42А Sullivantia oregana 276 AAGAATTTCGAT TCTCAA--—--- ATGATATCAGAG GGATTTGCAGTT 
ЗР Saxifraga integrifo. — ........ A se у ико VANTABN, клона C 
Saxifraga punctata 
Saxifraga ferrugine. 
E 42А Sullivantia oregana 354 TCTTCCTTAGAA AGGAAAGAA--- ATAGTAAAATCT CATAATTTACGA 
ЗР Saxifraga сетиа — — — ...... аль САА за а Ен Me Sonet 
Saxifraga oppositif. 
Astilbe jap. х chin. 
F ААР Sullivantia oregana 366 GAAAGGAAAGAA ATAGTAAAATCT ———--—- CATAAT TTACGATCAATT 
ІА Saxifraga mertensia. pagau Ra unas E (s stra О СААААТ. „Ва. dem va C as 
G 40P Sullivantia oregana 366 GAAAGGAAAGAA ATAGTAAAATCT CATAATTTACGA TCAATTCATTCA 
5А Heuchera micrantha MÁS Lau WS: Siew fh ws а» Gee. шл Eris as, ee, 
Conimitella willia. 
Mitella diversifol. 
Mitella stauropeta. 
Tellima grandif. S. 
H 43A Sullivantia oregana 549 TTTCTCTATGAG TAT------ CAG AGTTGGAATAGT CTTATTACCCCA 
2P Chrysosplenium лоше.  ..... TA sm. па + ТАТАТТЕ G А ФА Са яриан 
Chrysosplenium teta. 
I ААР Sullivantia oregana 573 | AGTTGGAATAGT CTTATTACCCCA ACTCCAAAGAAA TCCATTTCCATT 
ІА Tetracarpaea tasman. Ах desig. ъй т у каг» ANS di quee в раа аа њен ин Шз» 
J 43Р Sullivantia oregana 591 CCAACTCCAAAG AAATCCATTTCC ATTGTTTCACAA AGGAATCAAAGA 
ZA Leptarrhena руга „Аара. Tau gee SSE Ruse adios COSME аа E ЈЕ 


Tanakaea radicans 


